The Size of Message Set Needed for the Optimal Communication Policy
نویسندگان
چکیده
Communication is a key for facilitating multi-agent coordination on cooperative problems. In our previous work, we proposed Signal Learning (SL) and Signal Learning with Messages (SLM) by which agents learn local policies of communication and action simultaneously in MultiAgent Reinforcement Learning (MARL) framework. Our experimental results showed that both SL and SLM can improve the performance of agents’ coordination. In this paper, we focus on theoretical analysis of the conditions for constructing optimal local policies on SL and SLM framework in Decentralized Partially Observable Markov Decision Processes with Communication (Dec-POMDP-Com) models. As main results, we obtain the minimum required sizes of the message set for off-line computation of optimal local policies on SL and SLM. In addition, we report experimental results indicating that the extra messages make some positive effect in learning processes when the size of the message set is larger than the minimum required size based on theoretical analysis.
منابع مشابه
Central Bank Transparency and Monetary Policy Effectiveness
The paper concentrates on the conditions, contingencies and determinants of central bank transparency and communication. From the state of the economy and the quality of national institutions, to the structure of monetary policy committees, the personality of the governor and the nature of the monetary policy framework - with a particular focus on the case of inflation targeting, there i...
متن کاملSocial Value of Information and Optimal Communication Policy of Central Banks
Monetary policy as a tool for expectations management is believed to be most effective if it can coordinate the beliefs and expectations of the economic agents. The optimal communication policy is in an environment where central bank announcements are common knowledge and abundant information is complete transparency. The above conclusion is altered in the more realistic situation where economi...
متن کاملWhistle Blowing: A Message to Leaders and Managers; Comment on “Cultures of Silence and Cultures of Voice: The Role of Whistleblowing in Healthcare Organizations”
This comment argues that instead of worrying about the pros and cons of whistleblowing one should focus on the more general problem of the failure of upward communication around safety and quality problems and consider what leaders and managers must do to stimulate subordinates to communicate and reward such communication. The article analyzes why safety failures occur and introduces the concep...
متن کاملAn Integrated Production-Inventory Model with Backorder and Lot for Lot Policy
inventory model, backorder buyer , vendor, lot for lot policy In this paper, an inventory model for two-stage supply chain is investigated. A supply chain with single vendor and single buyer is considered. We assume that shortage as a backorder is allowed for the buyer and the vendor makes the production set up every time the buyer places an order and supplies on a lot for lot basis...
متن کاملStatistical Study to Find the Optimal Dose of the First Phase in Leukemia Patients
Abstract. In this paper, a structured framework for determining the optimal dose of the first phase in leukemia patients is presented. And here we describe and discuss the key parameters that are needed to set up and run a CRM trial. These are: number of doses; maximum tolerable dose; Target Toxicity Level Dose-toxicity model; Dose-toxicity skeleton; Sample size and cohort size and Stopping ru...
متن کامل